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iBSTRlCT 

An informal reading inventory (IRI) consists of 
graded passages of increasing difficulty and a series of 
coaprebension guestions for each grade level reading selection. The 
IRI is lost often used to assist classroos teachers in the placement 
of children in groups for purposes of instruction. li. this paper, the 
history of the IRI is presented and the probles or selecting 
perfornance criteria, validity and reliability, and the value of iBis 
for determining the instructional level of pupils are discussed. 
Selected literature, supporting the contention that lost teachers 
cannot be successful in using the IRI without training in 
construction, adainistration, and interpretation of such an 
instruaent, concludes the paper. (TO) 
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OWNER - 

It is appropriate to begin the discussion of IRIs 
with Betts' initial article on this subject. Betts 
emphasized the use and value of the IRI in assisting 
classroom teachers in the placement of children in groups 
for purposes of instruction.^ 

BettL stated the following advantages of usiofi an 
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IRI: 

1. The teacher is given direct evidence on achievement 
and needs in terms of available instructional 
material. 

2. The teacher is provided with a technique for 
detecting everyday needs in the classroom. 

3. The child is convinced of his needs and sees how 
to improve his skills. 2 

Basically, an IRI consists of a series of graded 
passages, of increasing difficulty, and a series of compre- 
hension questions for each grade level selection to be read. 
Betts has suggested the following criteria for the selection 
and organization process of the passages to be included in 
an IRI. 

First, the material should be within the interests of 
the learner. When it id not possible to live up to 
this criterion, extra motivation must be supplied in 
the tests situation. Second, the material should be 
graded in readability, Thl3?d, the content of the 
material should be the same as, or closely related to, 
that available in the classroom situation.^ 



^ 1 

^ Emmett A. Betts, "Reading Problems at the Inter- 
^ mediate Grade Level", Elementary School Journal . XL (June, 
1940), 737-46. 



S 2 

^ ^Emaett A, Betts, Foundations of Reading Instri^c- 

Vj tion . (New York: American Book Company, 1954), p. 478. 

ErJc hhid. . pp. 457-58. 



BEST COPY RVRllABll 

Since Betts* initial work in the area of IRIs, the 
instrument has found widespread acceptance in the field of 
reading. Several other investigators have attempted to 
determine criteria for interpreting IRIs. The most signif- 
icant studies are presented. 

Killgallon's study of 1942 is highly important. His 
contribution was two-fold: first, he reported that standard- 
ized tests did not discriminate at lower levels of per- 
4 

formance, and, second, he furnished Betts with part of the 
criteria which Betts later adopted for the scoring of his 
IRI. Killgallon examined 41 fourth graders wi^h am informal 
reading instrument smd attempted to establish performance 
criteria for scoring such instruments. The procedure used 
was unorthodox. Criteria were prepared, the subjects 
tested, the performance analyzed, and a new set of criteria 
devised. This procedure yielded the 95 per cent word 
recognition score. It was this criterion score which was 
adopted by Betts. 

Cooper also attempted to establish criteria for 
an IRI* Based on a pre-test/post-test design. Cooper 

Patsy A. Killgallon, "A Study to Determine Relation- 
ships Among Certain Pupils' Adjustments in Language Situations 
(unpublished doctoral dissertation, Pennsylvania State 
College, 1942). 
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compared standardized test scores administered at the 
beginning and end of the experimental period and observed 
that the group which made the fewest errors in word percep- 
tion made the greatest gains in overall reading achievement. 
The converse was also true.^ However, Cooper's sample 
lacked the size and diversity to allow his results to he 
generalized to other populations without a degree of 
caution. 

Cooper's study lends support to the assumption that 
a pupil's reading ability can be fudged, in part, by the 
degree of word perception he has. Thus, IRIs typically 
assess the amount of word recognition errors at each grade 
level as an indication of performance. 

McCracken's study appears to be tne most thoroughly 
conducted investigation on this subject to date. The 
passages of the test were validated with accepted readability 
formulae, and they closely adhered to the vocabulary contents 

of three basal reader series. McCracken's sample was large, 

6 

over 600, and encompassed grades one through six. Although 

^J. Louis Cooper, "The Effect of Adjustment of Basal 
Reading Achievement" (unpublished doctoral dissertation, 
Boston University, 1952). 

^Robert A. McCracken, "The Development and Validation of 
the Standard Reading Inventory for the Individual Appraisal of 
Reading Performance in Grades One through Six, ''Proceedings of 
the International Reading Association . IX (Newark, Delaware: 
International Reading Association, 1964), 310-15. 
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McCracken involved the ratings of twenty-five nationally 
known reading authorities, he did use Betta* criteria, 
partially based on Killgallon's study. Because of this, he 
has had his findings accepted with a degree of caution. 
The major point of contention has always been aimed at the 
acceptance of these performance criteria, axid the subject 
has not yet been laid to rest# 

Performance Criteria of the 
Informal Reading Inventory 

As previously stated, the issue of performance 
criteria has always been unsettled. Interestingly enough, 
no one is actually aware of how Betts* and Killgallon's 
criteria were determined. Beldin asked Betts how they 
were determined and Betts replied by personal correspon- 
dence: 

This study (Killgallon* s) was done after we had been 
using mimeographed materials, etc., for teaching the 
informal inventory. Hence, I would assume that he 
(Killgallon) had used criteria we had originally set up. 



'h.O. Beldin, "Informal Reading Testing: Histori- 
cal Review and Review of the Research", Reading Difficulties ; 
Diagnosis. Correction and Remediation (Newark, Delaware; 
International Reading Association, 1970), p. 75. 



Killgallon's study contains conflicting evidence. 

Killgallon wrote: 

Criteria for defining the probable instructional 
level, the probable frustration level, and the 
prooable reading capacity level were arbitrarily 
established by the investigatorgin connection with 
the Informal Reading Inventory. 

Although the criteria us^d by Killgallon and later 

adopted by Betts appear to be arbitrary decisions, at best, 

these criteria are not at all inconsistent with previously 

Q 10 11 12 

done research by Thorndike, ^» Bolenius, and Durrell. 

Since all theories roust have their beginnings, Killgallon 

is to be commended for having taken such a step. It should 

not be forgotten that these criteria have been fairly well 

validated through use by many of the more prominant 

authorities in the field of reading. 



""Killgallon, op. cit .. p. 9. 

^E. L. Thorndike, "Improving the Ability to Read", 
Teachers College Record . XXXVI (November, 1934), 123-44. 

-^^E. L. Thorndike, "Improving the Ability to Read", 
Teachers College Record . XXXVI (December, 1934), 229-41. 

■^^E. M. Bolenius, Teachers Manual of Silent and 
Oral Reading (Boston: Houghton-Miffiin Co., 1919). 

^^Donald Durrell, "Individual Differences and 
Their Implications with Respect to Instruction in Reading", 
Teaching of Reading: A Second Report . The Thirty-sixth 
Yearbook of the National Society for the Study of Education, 
(Bloomington, Illinois: Public School Publishing Company, 
1937), pp. 325-26. 



Perhaps the most outspoken critic of the Betts- 
Kilipallon perforraance levels has heen Powell, who contended 
that the Betts-Killgallon criteria were unrealistically high 
and lacked the support of empirical data,-*-' Powell 
compiled the performance criteria of various authorities 
and foiind little deviation from the Betts-Killgallon 
criteria. Notable excepl;I6ns to the statement came from 
Smith^^, who proposed a lower percentage of accuracy, and 

Spache, who asserted that Betts' standards were arbitrarily 
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set too high. Powell completed a study in which he 
tested the hypothesis that a suitable criterion for word 
recognition could be lower than the previously accepted 
95 per cent level. He was concerned with two components: 
word recognition atnd comprehension. Holding comprehension 
constant at 70 per cent, Powell scanned the performarce of 
his subjects on word recognition. He submitted the 



^William R. Powell, "Reappraising the Criteria for 
Interpreting Informal Reading Inventories", Proceedings of 
the International Reading_A38Qciatlon . XIII, Part IV (Newark, 
Delaware: International Reading Association, 1969), 100-09, 

■^^Nila Banton, Smith, Graded Selections for Informal 
Reading; Diagnosis for Grades 1 through (New lork; 
University Press, 1959}. 



•^George D. Spache, Reading in the Elementary 
School (Boston: Allyn and Bacon, 1954;. 
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The data suggest that the mean scores of grades one and 
two tend to cluster together '^ercentage-wise, and the 
mean percentages of grades e, four, five, and six, 
form a relatively similar -ntage score* The data 
clearly indicate that pupils in grades one and two 
could tolerate on the average an 85 per cent word 
recognition score and still maintain 75 per cent 
comprehension. To say I foun't this astonishing is an 
understatement: The findings that pupils in grades 
three through six could tolerate on the average a 
91 to 94 per cent word recognition score while 
maintaining 70 per cent comprehension is commensurate 
with the data of Killgallon.io 

Many authors still continue to use various modifi- 
cations of the traditional performance criteria. Ransom 
summarized the performance criteria of several authors to 
illustrate the diversity of opinion on that subject. That 
table is presented on the next page. 

The preceding discussion of performance criteria 
and the chart are presented to illustrate that the IRI is 
a powerful concept which is accepted by reading authorities 
even though a difference of opinion exists as to which 
performance criteria should be accepted. 



Powell, OP. ext .. p. 106, 



TABLE 1 

CRITERIA FOR ESTABLISHING READING LEVELS WHEN 
USING AN INFORMAL READING INVENTORY^'^ 
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Ransom, op. cit « , p# 19. 



Validity and Reliabi lity of the 
InfonaaX Reading Inventory 



The validity and reliability of an IRI is of an 
elusive nature; little has been written on this aspect 
of the IRI until recent years. For many years, the 
validity of the IRI has been assumed. Since the IRI is 
usually composed of selections from actual basal readers 
the pupil is using, his reading perforraance is a valid 
indication of his ability to handle basal material* 
Furthermore, since the questions associated with the 
graded passages are, in fact, similar to actual classroom 
questions, their validity is also assumed. Researxh in recent 
years has attempted to lend support to the assumptions of 
validity with empirical data* An exsusple of such a study 
was done by McCracken, who statistically validated his 
Standardized Reading Inventory * The Inventory contains 
many features found in wost IRIs, Using alternate forms 

of his instrument, he found significant correlations 

18 

among them at the .001 level of confidence. Si pay 
compared two parallel forms of an IRI and found correlations 
of .78 to .82 between them.^^ 



Robert A. McCracken, "The Development and Valida- 
tion of the Standard Reafing Inventory for the Individual 
Appraisal of Reading Performance in Grades One through Six** 
(unpublished doctoral dissertation, Syracuse University, 
1965), pp. 47-68. 

^Edwaird R. Sipay, "A Comparison of Standardized 
Reading Scores and Functional Reading Levels", The Reading 
Teacher XVII (January, 1964), 265-68. 



The question of the reliability of the IRI is, at 
best, a complex problem. The reliability technique of 
test-retest does not lend itself to an IRI. There are two 
reasons for this. The first reason ateas from the probability 
that performance will improve with each additional exposure 
to the material. The second reason rests with the purpose 
of the instrument. An IRI i administered with the intent 
of being able to improve performance with a prescriptive 
reading program based upon the information obtained from 
the instrument. Thus, the instrument would rarely be ^Uven 
in an effort to record a duplicate level of performance. 
The technique of split-half reliability is not suitable 
for use since the values of the test items are unequal. 
It would seem, then, that the reliability of an IRI is 
best measured by comparing performance on this instrument 
with actual reading performance. Hence, the argument that 
the IRI is valid, since the materisas used in the test 
construction are, ideally, the identical material used in 
actual classroom reading situations, that is, basal readers. 
It is further argued that because of this factor, IRls are 
more accurate than standardized tests for purposes of 
determining a pupil's instructional level. The following 
review of the literature would seem to indicate that there 
is a good deal of truth to this argument. 



Comparison of the Informal Reading Inventory 
with Standardized Tests as Determiners 
of Instructional Level 

McCracken compared pupils* scores on an IRI with 
their scores on the Iowa Comprehension Test . He reported 
that: 

Pour pupils achieved 0 to 1 year better on the Iowa 
Comprehension than their informal iii;lnimum instructional 
level; twelve pupils achieved 1 to 2'^years better; 
twenty-eight pupils achieved 2 to 3 years better: nine 
pupils achieved 5 to 4 years better; and three pupils 
achieved 4 to 5 years better. ^0 

On the vocabulary section, McCracken found that: 

Six children achieved informal word recognition levels 
higher than their Iowa vocabulary grade level, four 
by less than one year, and two by one year. Twenty- 
two pupils achieved 0 to 1 year better on the Iowa 
vocabulary than their informal vocabulary level; 
eighteen pupils achieved 1 to 2 years better; nine 
pupils 2 to 3 years better; and 1 pupil achieved 3 to 
4 years better.21 

He summarized: 

The standardized test scores would place 63% (35 pupils) 
of the children at a level of frustration if their 
scores were used to determine the book level needed for 
reading instruction. It would place 95% (52 pupils) of 
the children in a book which is too hard for pupils* 
and teacher comfort, 22 

McCracken concluded that "the average difference 

between che Iowa Reading Comprehension grade levels and 

the informal immediate instructional reading level was 2.3 
2"? 

years'*. 



20 

R. A» McCracken, '♦Standardized Tests and Informl 
Reading Inventories'*, Education . LXXXII (February, 1962), 
367. 

^^Ibid. 
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Wheeler and Smith acknowledge the tendency for 
standardized tests to over-estimate a pupil's instructional 
level; they stated: 

It has long been known by those who work closely with 
children that grade placement scores on reading tests 
for primary grades often have little relationship to 
the child's actual instructional reading levelv In 
the Reading Clinic (University of Miami; we have found 
that pupils who cannot read a primer will often score 
above the second grade reading level on some stand- 
ardized tests. 24 

Betts stated that "standardized tests tend to place 

children at their frustration level". 

In a study similar to McCracken*s, Sipay concluded 

that: 

All three standardized tests Metropolitan Reading T^ at 
(I960), Gates Reading Survey (1958). and the California 
Reading Test (1957) « tended to overestimate tKe 
instructional level by approximately one or more grade 
levels. 26 

And Farr asserted, "The use of standaurdized reading 

tests as indicators of instructional reading level should 

27 

be abandoned" . 

This suggestion is no doubt based upon the fact 
that standardized tests are generally less accurate at the 
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^^Lester R. Wheeler and Edwin H. Smith, "Modification 
of the Informal Reading Inventory", Elementary English . XXXIV 
(April, 1957), 224. 

25 

Emmett A. Betts. Foundations of Reading Instruc- 
tion (New York: American Book company, I9t>7;, pp. 450-51» 

'^''Sipay, op. cit .. p. 267 

27 

Roger Parr, "Reading Tests and Teachers", Educa- 
tional Resources Information Center Document ED 020 082 
(Bethesda, Maryland: ERIC Document Reproduction Service, 
1968), Abstract. 



extremes of performance* In a study in which Betta compared 
the grade equivalent scores of several standr=«rdized tests 
designed for use at the fifth grade level with actual reading 
performance, he concluded that standardized testn are less 
than adeo'iate for determining achievement levels for pupils 
at either end of the distribution of scores,^® Chall also 
pointed out that standardized tests designed for a few 
grade levels gave a poor indication of reading performance 
for the poorest and the most able readers •^^ 

The above is by no means the total of all the 
available research which supports the use of the IRI 
rather than the standardized test for purposes of determining 
instructional level, but the implication is clear: stan- 
dardized tests are not accurate for this purpose. This b^ing 
the case, it seems that the obvious conclusion is for class- 
room teachers to administer an IRI to each pupil on an 
individual basis. This, however, would not be practical 
for a number of reasons, the roost obvious of which is the 
time factor. The testing of an entire class would extend 



Betts, Foundations of Reading Instruction, op , 
cit, , p. 441. 



Jeanne S, Chall, "Interpretation of Results of 
Standardized Reading Tests", Evaluation of Reading . Supple- 
mentary Educational Monographs, No, 88 C Chicago: University 
of Chicago Press, 1958), 133-38. 
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well into the beginning of the school year, and the 
instructional levels of many pupils would not be ascertained 
in time to be helpful to the teacher. This is not, however, 
the only disadvantage. The following discussion will 
introduce opinion as to some additional problems inherent 
in the use of th*? IRI, 

Problems Inherent in the Use of the 
Informal Reading Inventory 

There is reason to believe that the average untrained 
classroom teacher does not have the skills to construct, 
administer, and interpret an IRI. Even if allowances are 
made for the use of commercially available IRIs which have 
been carefully constructed, the tasks of administration and 
interpretation still remain beyond the skills of untrained 
teachers, 

Kender addressed himself to this point and maintained 

that: 

Anyone who is expected to administer an informal 
reading inventory must be thoroughly knowledgeable 
about the reading process and thoroughly skilled in 
administering the instrument, 50 



Joseph P. Kender, "How Useful Are Informal Reading 
Tests?'*, Journal of Reading . XI (February, 1968), 341- 
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Levy supported Kender's contention in her reply to 
his article, she stated that "the informal tests take a 
great deal of time as well as Icnowledge".^^ 

Kender specifically stated that; 

The usefulness, then, of an informal reading test is 
in direct proportion to the knowledge of the examiner 
who uses it; therefore, it is unlikely that just any 
classroom teacher can easily administer an informal 
reading inventory and judiciously interpret its results 
as is sometimes claimed. 32 

Kelley's thoughts on this subject were that: 

^ ven though the informal reading inventory represents 
a most effective instrument for evaluating a reader's 
performance, perhaps its greatest disadvantage relates 
to the competency of the examiner, since the accuracy 
of the data received through its use depends almost 
entirely upon the competence of the examiner. 53 

As a result of her study, Ladd decided that teachers 

were often inaccurate in identifying oral reading errors. 

Ladd's conclusions lend support to previously stated 

opinions which suggested that a high degree of skill is 

needed in the administratioii and interpretation of an 



31 

B. K. Levy, "How Useful Are Informal Reading 
Tests?", Journal of Reading . XII (October, 1968), 38. 

^^Kender, op. cit .. p. 341. 

^\elley, loc. cit. 
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informal reading inventory. She found that even after thirty 
hours of training, teachers in her study still incorrectly 

identified from 53 to 37 per cent of oral reading errors 

•54. 

made by pupils. 

A further comment on this problem was expressed by 

Della-Piana, Jenson, and Murdock who concluded; 

It is our judgment that construction, administration, 
and interpretation of informal reading inventories are 
far too time consuming for the regular classroom 
teacher. She either will not learn the skills, or 
having learned them will not use them in practice 
because of competing demands upon her time* 55 

Thus, the problem of providing classroom teachers 
with a simply constructed, easily administered, and objectively 
scored instrument for determining instructional level has 
not been solved. 

It seems that while an IRI yields an indication of 
instructional level, its major usefulness is as a diagnos- 
tic instrument. By carefully noting, classifying, and 
interpretating oral reading and comprehension errors, the 



54 

Eleanor Mary Ladd, "A Comparison of Two Types of 
Training with Preference to Developing Skill in Diagnostic 
Oral Reading Testing", (unpublished doctoral dissertation, 
Florida State University, 1961). 

35 

■'■^Gabriel Della-Piana, Betty Jenson, and Everett 
Murdock, "New Directions for Informal Reading Assessment", 
Reading Difficulties; Diagnosis. Correction. ?ind Remedia - 
tion (Newark, Delaware: International Heading Association, 
Vm), p. 127. 
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examiner is able to determine specifically the inadequate 

skills which result in poor reading performance. These 

observations should then result in a prescriptive program 

to improve reading performance. In this capacity, the IRI 

should not be discounted nor should it be replaced for 

this purpose. This clarification is offered since the 

major emphasis has been upon the IRI as an indicator of 

instructional level. Powell stated this most appropriately: 

The value of an IRI lies not in its identification of 
what has been called the instructional level (and the 
other levels by interpolation) because there are 
probably more effective and efficient methods of 
accomplishing such tasks. The use of cloze procedure 
is one alternative already available that has a 
considerable body of research data to support it. 
The real value of the IRI is that it affords the ,^ 
possibility of evaluating reading behavior in depth . 

Powell's statement is especially significant due 
to his reference to the cloze procedure as a possible sub- 
stitute technique for determining the instructional levels 
of pupils. 



Summary of the History & Development 
of the Informal Reading Inventory 

The material included in the preceding section 

pertained to the history and development of the IRI. The 



-^"'William R. Powell, •♦The Validity of the Instructional 
Reading Level; Informal Reading Inventory", Proceedings of 
the International Reading Association . XV (Newark, Delaware; 
International Heading Association, 1971), 151. 



problem of selecting performance criteria, validity and 
reliability, and the value of IRIs for determining the 
instructional level of pupils is also discussed. The 
final portion includes selected literature to support the 
contention that the average classroom teacher probably 
could not be successful in using the IRI due to a lack of 
training necessary to enable her to construct, administer, 
and interpret such an instrument. 



